Speech Logarithm Filtering Equal − Loudness Curve Power −
نویسندگان
چکیده
Most speech parameter estimation techniques are easily innuenced by the frequency response of the communication channel. We have developed a technique that is more robust to such steady-state spectral factors in speech. The approach is conceptually simple and computationally eecient. The new method is described, and experimental results are reported, showing a signiicant advantage for the proposed method.
منابع مشابه
Effectiveness of Near-End Speech Enhancement Under Equal-Loudness and Equal-Level Constraints
Most recently proposed near-end speech enhancement methods have been evaluated with the overall power (RMS) of the speech held constant. While significant intelligibility gains have been reported in various noisy conditions, an equal-RMS constraint may lead to enhancement solutions that increase the loudness of the original speech. Comparable effects might be produced simply by increasing the p...
متن کاملSpeaker Identification System based on PLP Coefficients and Artificial Neural Network
Speaker recognition is the process of automatically recognizing who is speaking on the basis of individual information included in speech waves. Feature extraction for speech recognition is a subject of a major interest today; different features have been investigated in speech recognition systems. The perceptual linear predictive PLP: this technique uses three concepts from the psychophysics o...
متن کاملPerceptual linear predictive (PLP) analysis of speech.
A new technique for the analysis of speech, the perceptual linear predictive (PLP) technique, is presented and examined. This technique uses three concepts from the psychophysics of hearing to derive an estimate of the auditory spectrum: (1) the critical-band spectral resolution, (2) the equal-loudness curve, and (3) the intensity-loudness power law. The auditory spectrum is then approximated b...
متن کاملSpeech Enhancement Using Gaussian Mixture Models, Explicit Bayesian Estimation and Wiener Filtering
Gaussian Mixture Models (GMMs) of power spectral densities of speech and noise are used with explicit Bayesian estimations in Wiener filtering of noisy speech. No assumption is made on the nature or stationarity of the noise. No voice activity detection (VAD) or any other means is employed to estimate the input SNR. The GMM mean vectors are used to form sets of over-determined system of equatio...
متن کاملCan modified casual speech reach the intelligibility of clear speech?
Clear speech is a speaking style adopted by speakers in an attempt to maximize the clarity of their speech and is proven to be more intelligible than casual speech. This work focuses on modifying casual speech to sound as intelligible as clear speech. First, we examine the role of speaking rate for intelligibility. Clear and casual speech signals are time-scale stretched, matching the average d...
متن کامل